MASSACHUSETTS INSTITUTE OF TECHNOLOGY 
ARTIFICIAL INTELLIGENCE LABORATORY 


A.I. Memo No. 815 

«!r 


December 1984 


Tense, Aspect and the Cognitive Representation of Time 

Kenneth Man-kam Yip 


ABSTRACT 

This paper explores the relationships between a computational theory of temporal representation 
(as developed by James Allen) and a formal linguistic theory of tense (as developed by Norbcrt 
Homstcin) and aspect. It aims to provide explicit answers to four fundamental questions: (1) what is the 
computational jusufication for the primitives of a linguistic theory; (2) what is the computational 
explanation of the formal grammatical constraints; (3) what are the processing constraints imposed on the 
learnability and markedness of these theoretical constructs; and (4) what are the constraints that a 
linguistic theory imposes on representations. We show that one can effectively exploit die interface 
between die language faculty and the cognitive faculties by using linguistic constraints to determine 
restrictions on the cognitive representations and vice versa. 

Three main results are obtained: (1) We derive an explanation of an observed grammatical 
constraint on tense -- die Linear Order Constraint -- from die information monotonicity property of the 
constraint propagation algorithm of Allen’s temporal system.: (2) We formulate a principle of markedness 
for the basic tense structures based on die computational efficiency of die temporal representations; and 
(3) We show Allen’s interval-based temporal system is not arbitrary, but it can be used to explain 
independently motivated linguistic constraints on tense and aspect interpretations. 

We also claim that the methodology of research developed in this study -- "cross-level" 
investigation of independently motivated formal grammatical theory and computational models -- is a 
powerful paradigm with which to attack representational problems in basic cognitive domains, e.g., space, 
time, causality, etc. 
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1. Objectives and Main Results 

One major effort in modern linguistics is to limit the class of possible grammars to those that are 
psychologically real. A grammar is psychologically real if it is (a) realizable - possessing a computational 
model that can reproduce certain psychological resource complexity measures, and (b) learnable - capable 
of being acquired (at least, in principle) despite the poor quality of input linguistic data, A shift of 
emphasis from the. pure characterization problem of grammar to the realization and learnability problems 
naturally brings linguistics closer to A1 work in natural language understanding concerned with 
computational models of language use and language acquisition. Computational study is in principle 
complementary to more formal and abstract grammatical theory. Each should contribute to the other. 

The purpose of this paper is to work out an example of how formal grammatical theory and 
computational models can effectively constrain each other’s representations. In particular, I seek to 
explore four fundamental issues: 

1. How is the choice of primitive structures in grammatical theory to be justified? 

2. What is the explanation of the rules and constraints that have to be stipulated at the 
grammatical level? 

3. How are these knowledge structures acquired? 

4. What are the theoretical constraints imposed by the grammar on the representational scheme of 
the computation theory? 


\Miat I hope to show is that structures and principles that have to be stipulated at the grammatical 
level fall out naturally as consequences of the properties of the algorithms and representations of the 
underlying computational model. In so doing, 1 will also restrict the class of plausible computational 
models to those that can explain or incorporate the constraints imposed by the formal grammatical 
theory. 

There are a number of requirements that must be met in order for such "cross-level" study to 
succeed. First, there is a sizable collection of facts and data from the target domain to be explained. 
Second, there is independent motivation for the theory of grammar — it is empirically adequate. And, 
third, the computational model is also independently motivated by being sufficiently expressive and 
computationally efficient. 

With these considerations, I have chosen two domains: (1) tense and (2) aspect. Tense concerns the 
chronological ordering of situations with respect to some reference moment, usually the moment of 
speech. Aspect is the study of situation types and perspectives from which a particular situation can be 
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vicwed or evaluated.-* The point of departure of this study is two papers: (1) for the theory of tense, 
Hornstein’s "Towards a theory of Tense" (Hornstein77) and (2) for the cognitive theory of time, James 
Allen's "Towards a General Theory of Action and Time" (Allen84). 

In the following, I shall list die main results of tliis study: 

1. A better theory of tense with revised primitive tense structures and constraints. 

2. We derive an explanation of Hornstcin’s Linear Order Constraint, an observed formal 
constraint on linguistic tense, from properties of the constraint propagation algorithm of 
Allen’s temporal system.. This shows this formal grammatical constraint need not be learned 
at all. We also show that the rule of R-permanence follows from the hypothesis that only the 
matrix clause and the subcategorizable SCOMP or VCOMP can introduce distinct S and R 
points. Finally, we prove that certain boundedness condition on the flow of information of a 
processing system leads directly to tire locality property of a constraint on sequences of tense. 

3. A principle of markedness for tense structures based on the computational efficiency of the 
temporal representation. The principle predicts that (1) of the six basic tenses in English, 
future perfect is the only marked tense, and (2) the notion of a distant future tense, just like the 
simple future, is also unmarked. 

4. A better account of the state/event/process distinction based on Allen’s interval-based 
temporal logic and the idea that the progressive aspect specifies the perspective from which the 
truth of a situation is evaluated. 

5. An account of theoretical constraints on the representation of time at the computational level, 
e.g., three distinct time points are necessary to characterize an elementary tensed sentence, and 
the distinction between instantaneous and non-instantaneous urne intervals. This shows that 
Allen's interval-based scheme is not arbitrary, but is motivated by independent linguistic 
constraints. 


1. This definition of aspect is slightly narrower than that accepted by most linguists. For instance, Comrie defined aspect as 
"different ways of viewing the internal temporal constituency of a situation". (Comrie76) 
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2. Tense 

We begin by first outlining Hornstein’s theory of tense. In section 2.1, we describe the primitives 
and constraints on tense of his theory. In sections 2.2 and 2.3, we show how the primitives and constraints 
can be derived from computational considerations. 

2.1 Revisions to Hornstein's Theory of Tense 


Hornstein develops a theory of tense within the Reichcnbachian framework which postulates three 
theoretical entities: S (the moment of speech), R (a reference point), and E (the moment of event). The 
key idea is that certain linear orderings of the three time points get grammaticalized into the six basic 
tenses of English. The following is the list of basic tense structures: 


1. SIMPLE PAST ' E,R_S 

2. PAST PERFECT E_R_S 

3. SIMPLE PRESENT S,R,E 

4. PRESENT PERFECT E_S,R 

5. SIMPLE FUTURE S_R,E 

6. FUTURE PERFECT S_E_R 

The notation here demands some explanation. The underscore symbol is interpreted as the 
"less-than" relation among time points whereas the comma symbol stands for the 
’’less-than-or-equal-to" relation. As an illustration, the present perfect tense denotes a situation in which 
the moment of speech is either cotcmporaneous or precedes the reference point, while the moment of 
event is strictly before the other two moments. Note that Hornstein also uses the term "association" to 
refer to the comma symbol",". 


Given the basic tense structure for a simple tensed sentence, the interpretation of the sentence that 
arises from the interaction of tense and time adverbs is represented by tine modification of the position of 
the R or E points to form a new tense structure which we call a derived lease structure. In two papers 
(Homstein77 & HornsteinSl), Hornstein proposes three formal constraints that limit the class of derived 
tense structures that can be generated from the basic tense structures in such a way as to capture the 
acceptability of sentences containing temporal adverbs (e.g., now, yesterday, tomorrow), temporal 
connectives (e.g., when, before, after), and indirect speech. In the rest of this section, 1 shall examine the 
adequacy of these constraints. 


. 2. Hornstein actually listed nine basic tenses, but I think the progressive belongs to the province of aspect rather than tense. 




5- 


2.1.1 Linear Order Constraint 

The Linear Order Constraint (L.OC) states that (p.523-4): 

(1) The linear order of a derived tense structure must be the same as the linear order of the basic 
structure. 

(2) No new association is produced in the derived tense structure. 

LOC is stipulated to account for examples consisting of a single temporal adverb such as (4a) and 
those with two time adverbs such as (32).^ 

4a. John came home i. “now, at this very moment 
ii. yesterday 
iii. “tomorrow 

32 a. John left a week ago [from] yesterday. 

b. [From] Yesterday, John left a week ago. 

c. “A week ago, John left [from] yesterday. 

The basic tense structure for 4(ai) is: 

E,R_S (simple past: John came home) 

Now modifies E or R so that they become cotemporaneous with the moment of speech S with the derived 
tense structure as follows: - 

E,R,S (BAD: violates LOC since new association is produced) 

On tire other hand, 4(aii) is acceptable because of the permissible derived tense structure: 

yesterday 

E.R S —> E,R S (OK: does not violate LOC) 

The crucial example, however, is 5(c):^ 

5c. John, has come home i. ?right now 

ii. “tomorrow 
iii. yesterday. 


3. The numberings are Hornstein’s. 

4. See footnote 7 and 11 of Hornstein’s paper. 
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LOC predicts (wrongly) that 5cii is good and 5ciii bad.'' But LOC gives the wrong prediction only on the 
assumption that the basic tense structures arc correct. To account for 5c, I propose to save die LOC and 
change the following SRE association with the present perfect: 

PRESENT PERFECT E_R,S 

With the modified basic tense structure for present perfect, LOC will give the correct analysis. 5cii is bad 
because: 

tomorrow Cb 

E_R,S —► E_S_R (linear order violated) 

5ciii is acceptable since: 

yesterday 

E_R,S —> E_R_S (OK since no new linear order 

and no comma introduced) 

The proposed modification also accounts for the unacccptability of sentence (48b) for which Hornstein 
has to appeal to some sort of Gricean maxims of conversation. 

48b, *[From] Tomorrow John has climbed Mt. Everest three times (in the 
past/last year). 

The analysis of 48b is:' 

tomorrow,1ast year 

E_R,S —* E_S_R (linear order is not maintained) 

The question that naturally arises at this point is: Why does Hornstein not choose my proposed SRE 
structure for the present perfect? The answer, I believe, will become apparent when we examine 
Hornstein’s second constraint. 

2.1.2 Rule for Temporal Connectives 

The rule for temporal connectives (RTC) states that (p.539-40): 

For a sentence of the form P^-conn^, where "conn" is a temporal connective such as "when", "before", 
"after" etc., line up the S points of P^ and Pj, that is, write the tense structure of Pj and P 2 , lining up the 

5. There may be doubts as regards the acceptability of 5ciii, An equivalent form of Sciii is acceptable is Danish (Jespersen65, 
p.271). Also, in French, the present perfect can be used for a situation that held not more than 24 hours before the present moment 
(Comrie76, p,61). 
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S points.^ Move R 2 to under R|, placing E 2 accordingly to preserve LOC on the basic tense structure. 

It can be easily seen that my proposed tense structure for present perfect does not work with RTC since it 
produces tire wrong predictions for the following two sentences: 

[1] *John came when we have arrived. 

[2] John comes when we have arrived. 

For [1] the new analysis is: 

E, R_S -♦ E, R_S 

I I 

E_R, S E_R_S 

which does not violate the RTC and hence-predicts (wrongly) that [1] is acceptable. Similarly, for [2], the 
new analysis is: 

S,R,E —► S.R.E (violates RTC) 

I I 

E_R,S E_S, R 

which predicts (wrongly) that [2] is bad. 

This may explain why Hornstein decides to use E_S,R for the present perfect because it can 
account for [1] and [2] with no difficulty. However, I suggest that the correct move should be to abandon 
RTC which has an asymmetrical property, i.e., it matters whether or P-> is put on top, and does not 
have an obvious semantic explanation. (See Homstein’s footnote 20, p.543). My second proposal is then 
to replace RTC with a Rule of R-pcrmanence (RP) stating that: 

(RP): Both the S and R points of P^ and P 2 must be aligned without any manipulation of the tense 
structure for P 2 . 

Thus sentence [3]: 

[3] John came when we had arrived. 

is acceptable because its tense structure does not violate RP: 


6. There is a typographic error in Homstein's paper where it is mistakenly stated as "lining up the Rs". 



E,R_S (OK: S and R points are already aligned) 

- E_R_S 

Now, let us reconsider sentences [1] and [2], Sentence [1] is not acceptable under RP and the new tense 
structure for present perfect since: 

E,R_S (violates RP: the two R's are not aligned) 

E_R,S 

Sentence [2] is still a problem. Here I shall make my third proposal, namely, that the simple present 
admits two basic tense structures: 

SIMPLE PRESENT S,R,E and E,R,S 

Given this modification, sentence [2] will now be acceptable since: 

E,R,S (S and R points are aligned) 

E_R, S 

To examine the adequacy of RP, let us look at more examples: 

[4] John has come when i. *we arrived 

ii. *we had arrived 

iii. we arrive 

iv. we have arrived 

v. *we will arrive 

The corresponding analysis is as follows: 


[4'] i. E_R, S 

E , R_S 

(BAD) 

ii. E_R,S 

E_R_S 

(BAD) 

iii. E_R,S 

E.R.S 

(OK) 

iv. E_R,S 

E_R,S 

. (OK) 

v. E_R, S 

S_R, E 

(BAD) 
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We can see Lhat the proposed theory correctly predicts all of the five cases. There is, however, an apparent 
counter-example to RP which, unlike RTC, is symmetrical, i.e., it does not matter which of the Pj's is put 
on the top. Consider tire following two sentences: 

[5] i. John will come when we arrive, 
ii. *John arrives when we will come. 

RP predicts both 5i and 5ii will be unacceptable, but 5i seems to be good. It is examples like 5i and 5ii, I 
believe, that lead Hornstcin to propose the asymmetrical rule RTC. But I drink die data arc misleading 
because it seems to be an idiosyncrasy of English grammar that 5i is acceptable. In French, we have to 
say an equivalent of "John will come when we will arrive" with the temporal adverbial explicidy marked 
with the future tense (Jespersen65, p.264). Thus, the acceptability of sentences like 5i can be explained by 
a principle of Economy of Speech allowing us to omit die future tense of the temporal adverbial if the 
matrix clause is already marked with the future tense. The same kind of explanation also works for 
examples 5iii and 5iv: 

[5] iii. John will come when we have arrived. 

iv. *John has come when we will arrive, 

where 5iii is really short for "John will come when we will have arrived" which is acceptable. 

2.1.3 Sequences of Tense 

Now, we describe the third and final grammatical constraint on sequences of tense. Consider the 
following sentences: 

[6] John said a week ago that Mary.(a) will leave in 3 days. 

(b) would 

[7] John said yesterday that Mary (a) left 3 days ago. 

(b) had left 

In the (a) sentences, the temporal interpretation of the embedded sentence is evaluated with respect to the 
moment of speech. Thus, for instance, [6a] means that Mary’s leaving is 3 days after present moment of 
speech. On the other hand, the (b) sentences have the interpretations evaluated with respect to the 
interpretations of the matrix clause. That is to say, [6b] means that Mary’s leaving is 4 days before the 
moment of speech. 

To account for the sequence of tense in reported speech, Hornstein proposes the following rule: 
(SOT): For a sentence of the form "P^ that P 2 ", assign S 2 with E^ 

In general, for an n-level embedded sentence, SOT states that: assign S n with E ^ (HornsteinSl, p.140). 



- 10 - 


With the SOT rule, [ 6 a] and [ 6 b] will be analyzed as follows: 

[6a'] a week ago 

I 

E l- R l— S 1 

S 2 _ R 2 ,E 2 = = > e 2 ^ days after Sj 

I 

in three days 

[6b'] a week ago 

I • 

Ej.Rj _Sj 

I 

s 2 _R 2 »==> e 2 is 4 days before s i 

I 

in three days 


The local property of SOT, i.e., linking occurs only between nth and (n-l)th level, has a nice 
consequence: it explains why a third level nested sentence like [ 8 ]: 

[8] John said a week ago (a) 

that Harry would believe in 3 days (b) 

that Mary (i) will leave for London in 2 days (c) 

(ii) would 

has only two temporal readings: (1) in 8 (ci). Mary’s leaving is two days after the moment of speech, and 
(2) in 8 (cii), Mary’s leaving is two days before the moment of speech. In parucular, there is not a 
temporal reading corresponding to the situation in which Mary’s leaving is five days before the moment 
of speech. We would obtain the third reading if SOT allowed non-local linking, e.g., assigned S 3 with E 3 . 

2.2 Explanations of the Formal Constraints 

In the previous section, we have examined three formal constraints on the derivation of complex 
tense structures from the basic tense structures: (1) LOC, (2) RP, and (3) SOT. Now, I want to show how 
the LOC falls out naturally from the computational properties of a temporal reasoning system along the 
line suggested by Allen (Allen84, Allen83), and also how the RP and SOT constraints have intuitive 
computational motivation. 

The basis of Allen’s computational system is a temporal logic based on intervals instead of dme 
points. The temporal logic consists of seven basic relauons and their inverses (Allen84, p.129, figure 1): 



-11 - 


Re 1 at ion symbol symbol for inverse meanina 

> XXX YYY 

= XXX 

YYY 

mi XXXYYY 

oi XXX 

YYY 

di XXX 

YYYYY 

Si XXX 

YYYY 

fi XXX 

YYYY 

The reasoning scheme is a form of constraint propagation in a network of event nodes linked by temporal 
relationships. For instance, the situation as described in the sentence "John arrived when we came" is 
represented by the network: 

A — (> < m mi = ) —> B 

\ 

( < ) 

NOW 

where A = John's arrival and B = Our coming 

This network means that both event A and event B are before now, the moment of speech, while A can be 
before, after or simultaneous with B. 

When new temporal relationships are added, the system maintains consistency among events by 
propagating the effects of the new relationships via a Table of Transitivity Relationships that tells the 
system how to deduce the set of admissible relationships between events A and C given the relationships 
between A and B, and between B and C. Thus, for instance, from the relationships "A during B" and "B 
< C", the system can deduce "A < C". 




( < ) 


X before Y < 

X equal Y = 

X meets Y m 

X overlaps Y o 

X during Y d 

X starts Y s 

X finishes Y f 


One property of the constraint propagation algorithm generally is that further information only 
causes removal of members from the set of admissible labels, i.e., temporal relationships, between any 
two old events (Allen83, p.835). No new label can be added to the admissible set once it is created. Let us 
call this property of the constraint propagation algorithm the Delete Label Condition (DLC). DLC can 
be interpreted as a kind of information monotonicity condition on the temporal representation. 
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Let us further restrict Allen’s temporal logic to instantaneous intervals, i.e.. each .event corresponds 
to a single moment of time. The restricted logic has only one primitive relation, <, and three other 
derived relations: <, >, and >. There is a straightforward translation ofHornstein's SRE notation into the 
network representation, namely, replace each comma symbol by < (or > with the event symbols 
reverse their roles) and each underscore symbol by > (or < with similar adjustment on the event 
symbols). Thus, a tense structure such as: E_R,S can be represented as: 


(> =) (>) 

V/ 

R 

With this representation scheme, we can prove the following theorem: 

(Tl) DLC -4 LOC 
Proof 

Let A and B range over { S, R, E } and A * B, There are five basic types of violations of the LOC: 

1. A_B —♦ B_A 

2. A_B —> A,B 

3. A_B —► B,A 

4. A,B —> B,A 

5. A,B -» B_A 

We can see that each of these cases is a violation of the DLC, To spell this out, we have the following 
operations on the constraint network corresponding to the above violations of the LOC: 

l’.A-(<)->B -* A-(>)->B 
2’. A -(<)-> B —♦ A -(< = )->B 
3’. A -(< )-> B -*A-(> = )->B 
4’. A -(< = )-> B -4 A -(> = )->B 
5’. A -(< = )-> B —> A -( >)->B 

In each of these cases, the operation involves the addition of new members to the admissible set This is 
ruled out by DLC. Thus, we have the result that if LOC is violated, then DLC is violated. In other 
words, DLC —+ LOc7 H - 


7. The converse of this theorem is not true, i.e„ there are cases in which DLC is violated but the LOC is still obeyed. 
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Thc second constraint to be accounted for is die RP which effectively suites tltat (a) the S points of 
the matrix clause and die temporal adverbial must be identical, and (b) the R points of die matrix clause 
and die temporal adverbial must be identical. One hypothesis for this rule is that: 

(HI) Only the matrix clause introduces distinct S and R points. 

0 

In other words, die non-subcategorizable temporal adjuncts do not add new S and R points.. 

HI has to be modified slightly to take the case of embedded sentence into account, namely, 

0 

(Revised RP): Only the matrix clause and the subcategorizable SCOMP or VCOMP can introduce 
distinct S and R points. 

where SCOMP and VCOMP stand for sentential complement and verbal complement respectively. The 
interesting point is that both the revised RP and the locality property of SOT can be easily implemented 
in processing systems which have certain boundedness constraint on the phrase structure rules (e.g., 
information cannot move across more dian one bounding node). To illustrate this, let us consider the 
following tense interpretation rules embedded in the phrase structure rules of the Lexical-Functional 
Grammar (Kaplan82): 

S —+ NP VP 
(4 S-POINT) = NOW 

VP —+ V (NP) (ADVP) (S') 

(4 S-POINT) =(( T E-POINT) if (4 tense) = PAST 
"[NOW otherwise 

ADVP -4 Adv S 
S' COMP S 
Adv —♦ when 

( T T-REL) = { <,>,.* ,m,mi } . 

before 

( T T-REL) = { > > 

The S rule introduces a new S point and sets its value to now. The VP rule has two effects: (1) it does not 
introduce new S or R points for the temporal adverbial phrase, thus implicitly incorporating the revised 
RP rule, and (2) it looks at the tense of the embedded sentential complement, setting the value of its S 
point to that of the E point of the higher clause if the tense is pasl, and to now, otherwise. Thus, in this 
way, the second effect accomplishes what the SOT rule demands. 


8. These are temporal adverbs which can have multiple occurrences, can be fronted, and are omissible without affecting the 
meaning of the matrix clause. ■ 



2.3 Implications for Learning 


If the revisions to Hornstcin’s theory of tense are correct, the natural question to be asked is: How 
do speakers attain such knowledge? This question has two parts: (1) How do speakers acquire the formal 
constraints on SRE derivation? and (2) How do speakers learn to associate the appropriate SRE structures 
with the basic tenses of the language? 

Let us consider the first sub-question. In the case of LOC, we have a neat answer -- the constraint 
need NOT be learned at all! We have shown that LOC falls out naturally as a consequence of the 
architecture and processing algorithm of the computational system. As regards the constraint RP, the 
learner has to acquire something similar to HI. But HI is a fairly simple hypothesis that does not seem to 
require induction on extensive linguistic data. Finally, as we have shown in the previous section, the 
boundedness of the flow of information of a processing system (as, for instance, illustrated in the phrase 
structure rules of the Lexical "Functional Grammar) leads directly to the locality property of the SOT. 
The particular linking of S and E points as stipulated by the SOT, however, is a parameter of the 
Universal Grammar that has to be fixed. 

What about the second sub-question? How do speakers learn to pair SRE configurations with the 
basic tenses? There are 24 possible SRE configurations seven of which get grammaticalized. Here I want 
to propose a principle of markedness of SRE structures that has a natural computational motivation. 

Let us recall our restrictive temporal logic of instantaneous interval with one primitive relation, <, 
and three derived relations: <,>, and >. Represent a SRE configuration as follows: 

S -> E 

V 

The admissible labels are among {<,< = ,>,>=}. So there are altogether 64 possible configurations that 
can be classified into three types: 

(1) Inconsistent labelings (16), e.g., 
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(2) Labelings that do not constrain the SE link given the labelings of SR 
and RE (32), e.g.: 

S -(?)-> E 

\ 7 1 

«) (>) 

V 

(3) Labelings that are consistent and the SE link is constrained by the 
SR and RE link ( 16) , e.g., 

S -(<)-> E 

\ T 

(<) (<) 

V 

If we assume that labelings of the third type correspond to the unmarked SRE configurations, the 
following division of unmarked and marked configurations is obtained: 


UNMARKED 


MARKED 

E_R_S 

PAST PERFECT 

E_S_R 

E , R_S 

SIMPLE PAST 

E , S_R 

E_R, S 

PRESENT PERFECT 

E_S, R 

E, R, S 

SIMPLE PRESENT 

E , S, R 

S, R, E 

SIMPLE PRESENT 

S_E_R FUTURE PERFECT 

S, R_E 


S_E , R 

S_R, E 

SIMPLE FUTURE 

S, E_R 

S_R_E 


S,E,R 

R_S_E 

R_S,E 

R_E_S 

R_E,S 

R, E_S 

R, S_E 


R.E.S 

R.S.E 

The interesting result is that five out of the six basic tenses have unmarked SRE configurations. 
This agrees largely with our pretheorctical intuition that the SRE configurations that correspond to the 
basic tenses should be more "unmarked" than other possible SRE configurations. The fit, however, is not 
exact because the future perfect tense becomes the marked tense in this classification. 
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Anothcr prediction by this principle of markedness is drat both the simple future (S_R.E) and 

distant future (S_R_E) arc unmarked. It would be interesting to find out whether there are languages 

in which the distant future actually gets grammaticalizcd. 

The final point to be made is about the second type of labelings. There are two other possible ways 
of grouping the labelings: (1) given SR and SE, those labelings in which RE is constrained, and (2) given 
SE and RE, those in which SR is constrained. But these types of grouping are less likely because they 
would yield the simple present tense as a marked tense. Thus, they can be ruled ou: b;> relatively few 
linguistic data. 

2.4 Implications for Cognitive Representation 

I have already touched on the cognitive representation of time in various places in the previous 
sections. Here I just want to reiterate two important points. First, the computational constraint to 
maintain consistency of temporal relationships has theoretical significance inasmuch as the formal 
constraint LOC follows directly from the processing characteristics of the constraint propagation 
algorithm and the representation method. 

Second, the grammatical theory requires the representation of three different moments of time for a 
tensed sentence. However, as far as the problem of tense is concerned, a temporal structure <T,<> with 
linear ordering of time points will suffice. To make use of the full power of Allen’s temporal logic, we 
have to consider the problem of verb aspect. That is the topic of the next few sections. 
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3. Verb Aspect 

The two main problems of the study of verb aspect arc the correct characterization of (1) the three 
fundamental types of verb predication according to the situation types that they signify -- state, process 
and event, and (2) the perspectives from which a situation is viewed, or its truth evaluated -- simple or 
progressive. ^ In the first part of his paper, Allen attempts to provide a formal account of the 
state/process/event distinction using a temporal logic. However, 1 believe that his characterization fails 
to capture well-known patterns of tense implications, and does not make the distinction between situation 
types and perspective types fundamental to any adequate account of verb aspect. In the next section, I 
will present some data that any theory of verb aspect must be able to explain. 

3.1 Data 

3.1.1 Tense Implications 

1. Statives rarely take the progressive aspect 9 10 , e.g., 

I know the answer. 

*1 am knowing the answer. 

2. For verb predications denoting processes, the progressive of the verb form entails the perfect form, i.e., 

x is V-ing x has V-ed, 

For instance, 

John is walking —> John has walked. 

3. For verb predications denoting events, the progressive of the verb form entails the negation of the 
perfect form, i.e., 

x is V-ing —* x has not V-ed. 

For instance, 

John is building a house —► John has not built the house. 


9. Some of the better works in the literature are: Vendler67, Kenny63, Comrie76, Mourelatos78. 

10. It has often been pointed out that some statives do take the progressive form. Eg., "I am thinking about the exam.", "The 
doctor is seeing a patient" However, a concrete piece of evidence is provided by Ota in his study of tense and aspect usage 
involving written material and ordinary conversation which shows that there is a class of verb, including familiar statives such as 
"believe", "think", "see", "know", that rarely occurs with the progressive aspect-- less than 2 % of the total occurrences of the verb. 
(Ota63, section 2.2) 


3.1.2 Sentences containing When 



Sentences containing clauses connected by a connective such as "when" have different aspect 
interpretations depending on the situation types and perspective types involved. 

[9] John laughed when Mary drew a circle. 

Situation/Perspective type: X = process/simple; Y = event/simple 
Interpretation: X can be before, after or simultaneous with Y 

[10] John was laughing when Mary drew a circle. 

Situation/Perspective type: X = process/progressive; Y = event/simple 
Interpretation: Y occurs during X. 

[11] John was angry when Mary drew a circle. 

Situation/Perspective type: X = state/simple; Y = event/simple 
Interpretation: X can be before, after, simultaneous with or during Y. 

[12] John was laughing when Mary was drawing a circle. 

Situation/Perspective type: X = process/progressive; Y = event/progressive 
Interpretation: X must be simultaneous with Y. 

3.1.3 Sentences containing Before/After 

When the temporal connective is "before" or "after”, the adverbial clause cannot contain the 
progressive aspect. 

[13] John laughed before/after Mary painted the wall. 

[14] John laughed *before/*after Mary was paindng the wall. 

[15] John laughed before/aftcr Mary was angry, 

3.1.4 Event Verb with Present Perfect Tense 

An event verb with present perfect tense cannot have a continuative aspect reading: 

[16] He has lived in Cambridge for the past 3 months. (CONTINUATIVE) 

[17] He has sailed in the sea for the past 3 months. (CONTINUATIVE) 

[18] He has asked this question for the past 3 months. (‘CONTINUATIVE) 
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3.2 Formal Account of the State/Proccss/Kvent distinction 
Define: 

(a) XCY^XdYVXsYVXfY 

(b) XCY«XCYVX equal Y 

(c) mom(t) •«-»• t is an instantaneous interval, i.e„ consists of a single moment of time 

(d) pcr(t) «-*• t is a non-instantaneous interval^ 

where X and Y are generic symbols denoting state, event or process. 

3.2.1 Progressive 

(PROG): OCCUR(PROG(v,t)) ~ mom(t) A -i OCCUR(v.t) A (3 f)(t df A OCCUR(v,t’)) 

The progressive aspect is the evaluation of a situation from an interior point t of the situation which has 
the property that though the sentence is not true at that instantaneous interval, it is true in a 
non-instantaneous interval f properly containing t. 

3.2.2 State 

(SI): OCCUR(s,t) ♦♦ (V t’)(mom(t’) A t’ C t -» OCCUR(s,t’)) 

A state verb is true at every instantaneous interval of t. The definition is similar to Allen's H.l (Allen84, 
P-130). 

The following theorem shows that state verbs do not occur with the progressive aspect. 

(S-THEOREM): *OCCUR(PROG(s,t)) 

Proof 

OCCUR(PROG(s,t)) ~ mom(t) A -« OCCUR(s,t) A (3 t’)(t d t’ A OCCUR(s,f)) 

—t OCCUR(s,t’) for some f containing t 
OCCUR(s,t) (by SI) 

.'. contradiction. H 


11. This section benefits from the insights of Barry Taylor (Taylor77). 
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This theorem raises the following question: Why do some statives occur with the progressive? I 
think there are two answers. First, the verb in question may have a use other titan the stative use (e.g. 
"have" is a stative when it means "possession”, and not a stative when it means "experiencing" as in 
"John is having a good time in Paris.”) Second, the English progressive may have a second meaning in 
addition to that characterized by PROG above. A frequent usage of the progressive is to indicate short 
duration or temporariness, e.g., in "They arc living in Cambridge’VThey live in Cambridge". 

3.2.3 Process 

A process verb can be true only at an interval larger than a single moment. This property differs crucially 
from that of the statives. 

(PI): OCCUR(p.t) -»per(t) 

(P2): OCCUR(p.t) - (V t’)(per(t’) A t’ C t -+ OCCUR(p,t’)) 

The following theorem shows that for a process verb, the progressive verb form entails tile perfect form. ' 
(P-THEOREM) OCCUR(PROG(p,t)) (3 t’)(per(t’) A t’< t A OCCUR(p,t’)) 

Proof 

OCCUR(PROG(p,t)) -» mom(t) A -i OCCUR(p.t) A (3 t’)(t d t’ A OCCUR(p,t’)) 

—» OCCUR(p.t’) for some t’ such that t d t’ 

-> 3 m^ € t\ m^ < t (since t d t’) 

—> 3 € t’. m^ < n^ < t (by density of time points) 

Let t" be the interval [m^n^]. Then, we have t" < t and t" C t’. By (P2), we have OCCUR(p,t"). That is, 
p has occurred. —1. 

The characterization of process verb by Allen (his 0.2) is less satisfactory because it combines both 
the notion of progressive aspect (his "OCCURRING") and the process verb into the same axiom. 
Furthermore, the difference between the predicate "OCCUR" and "OCCURRING" is not adequately 
explained in his paper. 

3.2.4 Event 

An event verb shares an important property with a process verb, namely, it can be true only at a 
non-instantaneous interval. 

(El): OCCUR(e,t) -> per(t) 

(E2): OCCUR(e,t) - (V t’)(per(t*) AfCt-*i OCCUR(e,t’) 

The following theorem shows that the progressive form of an event verb entails the negation of the 
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pcrfcct form. 

(E-THEOREM): OCCUR(PROG(c,t)) -+ ->(3 t’)(pcr(t’) A t'< t A OCCUR(e,t*)) 
Proof 


As in the proof of (P-THEOREM), we can find a non-instantaneous interval t" such that t" < t and t" C 
t’. But for any such t", we have -i OCCUR(e,t”) because of (E2). That is, it cannot be the case that e has 
occurred. H. 

Again the crucial property (El) is not captured by Allen’s characterization of events (his 0.1). 

3.3 Constraint on temporal interpretations involving When 

To account for the variety of aspect interpretations as presented in section 3.1.2, I propose the 
following constraint on situation/perspective type: 

(C-ASPECT): Let "dynamic" stand for a process or event. 

(a) simple/dynamic -+mom(t) 

(b) simple/state —► per(t) 

(c) progressive/dynamic —► per(t) A C 

Perspective is a way of looking at the situation type. For process or event, the simple aspect treats the 
situation as an instantaneous interval even though the situation itself may not be instantaneous. For state, 
the simple aspect retains its duration. The progressive aspect essendally views a process or event from its 
interior,.thus requiring a stance in which the situation is a non-instantaneous interval and the admissible 
temporal relationship to be die C relations, i.e., 4 d, di, equal. 

Let me show graphically how C-ASPECT accounts for the aspect interpretations of sentences [9] to 

[ 12 ]. 

[9'] simp!e/process WHEN simple/event 

Admissible relations: 

< m = mi > 

X Y XY X YX Y X 

Y 
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[10'] progressive/process WHEN simple/event 

Admissible relations: 

si di fi 

XXX XXX XXX 

Y Y Y 

[11'] simple/state WHEN simple/event 

Admissible relations: 

> mi si di fi m < 

Y XXX YXXX XXX XXX XXX XXXY XXX Y 

Y Y Y 

[12'] progressive/process WHEN progressive/event 
Admissible relations: 


= 

f 

fi 

s 

si 

d 

di 

XXX 

XXX 

XXXX 

XXX 

XXXX 

XX 

XXXX 

YYY 

YYYY 

YYY 

YYYY 

YYY 

YYYY 

YY 


3.4 Constraint on temporal interpretations involving Beforc/After 

The constraint C-ASPECT also accounts for the fact that the temporal adverbial headed by 
"before" or "after" cannot contain the progressive aspect. Consider the analysis of [14]: 

[14] simple/process BEFORE/AFTER progressive/event 

C-ASPECT allows three temporal interpretations for the sentence, namely, 

s d f 

X X X 

YYY YYY YYY 

But the temporal connective "before" forces the interpretation X < Y whereas "after" forces X > Y. None 
of these interpretations is consistent with the admissible set allowed by C-ASPECT. Therefore, [14] is 
bad. On the other hand, sentences [13] and [15] will be acceptable since the interpretations forced by the 

temporal connectives are compatible with those admitted by C-ASPECT. 
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3.5 Constraints on Event Verbs in the Present Perfect Tense 
Consider the following contrast: 

[19] John has sailed in the sea (a) for the last 3 months 

(b) in ,• . 

[20] John has been sailing in the sea (a) for the last 3 months 

(b) ?in 

In [19] the present perfect tense combines equally well with the adverbial "for the last 3 months" and "in 
the last three months". On the other hand, [20b] is much less acceptable titan [20a]. This can be 
accounted for by the difference in truth conditions involving "for" and "in": 

(FOR): 0CCUR(v,FOR T) -+ (V t)(t C T -+ OCCUR(v.t)) 

(IN): 0CCUR(v,IN T) -4 (3 t)(t C T A OCCUR(v.t)) 

Intuitively, if a situation occurs for an interval T, then the situation continues or lasts the whole period. 
But if a situation occurs in an interval T, it indicates there is an interval t properly contained in T during 
which the situation takes place. This explains why the present perfect progressive (P.P) does not occur 
with adverbials of the type "in the last n months/years” because the P.P requires a continuous reading 
whereas the adverbial implies the situation is completed. 

Now, let us look at: 

[18] John has asked this question for the last 3 months. 


From (E-THEOREM), we see that for an event verb, the present perfect form implies here is a definite 
time interval before now during which the event is completed, and thus it is incompatible witli the 
present progressive form. That is to say, if x has V-ed, then x is not V-ing now. So, the event can neither 
continue nor last the whole period as demanded by the semantics of the "for-adverbial", thus ruling out 
the continuative reading of [18]. 

3.6 Implications for Learning 

What is a possible aspect? Unlike tense, aspect is not reducible to purely temporal considerations. 
For instance, the semantic properties of state/process/event naturally call for a two dimensional analysis 
of heir structures - a temporal axis and an activity axis. A state has'temporal duration but is constant 
wih respect to he activity axis. Both process and event vary wih he activity axis, but hey differ in one 
crucial aspect: an event is bounded on he activity axis, i.e., approaching or resulting in a definite goal, 
while a process is unbounded. If his explanation of he situation type distinction is correct, hen learning 
his distinction will involve two binary choices: [± activity] and [± goal]. To spell his out, we have: 
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process 
event 
state 

IMPOSSIBLE 

Given tliis picture of situation types, the perspective aspect can be interpreted as the selection of a 
point on the temporal axis from which a situation is viewed. Two stances are possible: one inside the 
duration of a situation and the other outside. These two perspectives correspond to the progressive and 
simple aspect that we find in English. Notice that there is no need to further distinguish the outside 
stance into, say, the before-the-situation-duration and aftcr-the-situation-duration stance. Such a 
distinction can be deduced from the combined effect of tense and simple aspect. 

There is a third kind of aspect that I have not discussed in this paper. Both the situation type and 
perspective aspect can be drought as topological properties, e.g., boundedness on the activity axis. It is 
possible for a language to grammaticalize metrical properties such as die length of die duration in which a 
situation occurs, or die frequency of occurrence of a situation over a period of time. At present, however, 
I have no theory about this type of aspect. 

3.7 Implications for Cognitive Representation 

We have seen how Allen’s interval-based temporal logic can be used to capture a variety of 
semandc properties of situation types and perspective types. The disunction between instantaneous and 
non-instantaneous intervals turns out to play an important role in defining die progressive aspect, the 
three situation types and die constraint C-ASPECT. The latter constraint in particular reveals the role of 
a representation scheme in explaining phenomena that exist on a entirely different -- linguistic -- level. 


activity 

+ 

+ 


goal 
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4. Conclusion 

In this paper, I have examined two problems regarding linguistic semantics: tense and aspect. 
Important relationships between abstract constraints governing linguistic behavior and a computational 
scheme to reason about temporal relationships are discussed. In particular, 1 have shown that certain 
formal constraints, such as the Linear Order Constraint on tense, fall out naturally as a consequence of 
some computational assumptions. An interesting result is that such formal constraints need not be 
learned at all. The study of linguistic semantics also sheds light on a representation of time by revealing 
the fundamental distinctions that must be made, e.g., a tensed sentence involves three distinct time points, 
and the aspectual interpretations require instantaneous/non-instantaneous interval distinction. 

Future problems to study include: 

1. The computational architecture of die temporal reasoning system to show how learning of tense 
is possible. 

2. The adequacy and computational explanation of the C-ASPECT constraint. 

3. What is a possible aspect? How learning of aspect is possible? 

4. Apply the methodology to other cognitive domains, e.g., space, causality etc. 
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